Network cross-validation by edge sampling
نویسندگان
چکیده
Many models and methods are now available for network analysis, but model selection and tuning remain challenging. Cross-validation is a useful general tool for these tasks in many settings, but is not directly applicable to networks since splitting network nodes into groups requires deleting edges and destroys some of the network structure. Here we propose a new network cross-validation strategy based on splitting edges rather than nodes, which avoids losing information and is applicable to a wide range of network problems. We provide a theoretical justification for our method in a general setting, and in particular show that the method has good asymptotic properties under the stochastic block model. Numerical results on both simulated and real networks show that our approach performs well for a number of model selection and parameter tuning tasks.
منابع مشابه
Prediction and early diagnosis of complex diseases by edge-network
MOTIVATION In this article, we develop a novel edge-based network i.e. edge-network, to detect early signals of diseases by identifying the corresponding edge-biomarkers with their dynamical network biomarker score from dynamical network biomarkers. Specifically, we derive an edge-network based on the second-order statistics representation of gene expression profiles, which is able to accuratel...
متن کاملBayesian Model Assessment and Comparison Using Cross-Validation Predictive Densities
In this work, we discuss practical methods for the assessment, comparison, and selection of complex hierarchical Bayesian models. A natural way to assess the goodness of the model is to estimate its future predictive capability by estimating expected utilities. Instead of just making a point estimate, it is important to obtain the distribution of the expected utility estimate because it describ...
متن کاملQSPR study of supercooled liquid vapour pressures of polybrominated diphenyl ethers using the molecular distance–edge vector index
The quantitative structure property relationship (QSPR) for supercooled liquid vapour pressures (pL) of polybrominated diphenyl ethers (PBDEs) was investigated. The molecular distance–edge vector (MDEV) index was used as the structural descriptor. The quantitative relationship between the MDEV index and log pL was modelled by multivariate linear regression (MLR) and an artificial neural network...
متن کاملUsing an Evaluator Fixed Structure Learning Automata in Sampling of Social Networks
Social networks are streaming, diverse and include a wide range of edges so that continuously evolves over time and formed by the activities among users (such as tweets, emails, etc.), where each activity among its users, adds an edge to the network graph. Despite their popularities, the dynamicity and large size of most social networks make it difficult or impossible to study the entire networ...
متن کاملTheory and Methodology Arti®cial neural networks in bankruptcy prediction: General framework and cross-validation analysis
In this paper, we present a general framework for understanding the role of arti®cial neural networks (ANNs) in bankruptcy prediction. We give a comprehensive review of neural network applications in this area and illustrate the link between neural networks and traditional Bayesian classi®cation theory. The method of cross-validation is used to examine the between-sample variation of neural net...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017